String Kernel Approach for Efficient Extraction of Medical Relations
ثبت نشده
چکیده
This paper presents a methodology for building an application that can classify healthcare information. It extracts informative sentences from medical papers that mentions about diseases and treatments, and then recognizes semantic relations that exist between the entities in the informative sentences. Support Vector Machine algorithm with String Kernel is used for relation identification. The proposed system avoids unnecessary information and gives the user disease and Treatment related sentences from medical pages. Evaluation results for this approach show that the proposed methodology obtains reliable results. The string kernel approach is also compared with naïve bayes approach and it was found that string kernel approach outperforms naïve bayes method for classification. This technique can be integrated with any medical management system to make good decisions and in patient management system by automatically mining the biomedical information from digital repositories. This system enables easy access to medical information in rural areas where there is a relative shortage of physicians. General Terms Classification, Machine learning.
منابع مشابه
Extraction of Drug-Drug Interaction from Literature through Detecting Linguistic-based Negation and Clause Dependency
Extracting biomedical relations such as drug-drug interaction (DDI) from text is an important task in biomedical NLP. Due to the large number of complex sentences in biomedical literature, researchers have employed some sentence simplification techniques to improve the performance of the relation extraction methods. However, due to difficulty of the task, there is no noteworthy improvement in t...
متن کاملA New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model
Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...
متن کاملCharacter based String Kernels for Bio-Entity Relation Detection
Extracting bio-entity relations has emerged as an important task due to the ever-growing number of bio-medical documents. In this paper, we present a simple and novel representation for extracting bio-entity relationships. The state-of-theart systems for such tasks rely on word based representations and variations of linguistic driven features. In contrast, we model bio-text by the most basic c...
متن کاملSemi-supervised Abstraction-Augmented String Kernel for Multi-level Bio-Relation Extraction
Bio-relation extraction (bRE), an important goal in bio-text mining, involves subtasks identifying relationships between bio-entities in text at multiple levels, e.g., at the article, sentence or relation level. A key limitation of current bRE systems is that they are restricted by the availability of annotated corpora. In this work we introduce a semisupervised approach that can tackle multi-l...
متن کاملThe Spectrum Kernel: A String Kernel for SVM Protein Classification
We introduce a new sequence-similarity kernel, the spectrum kernel, for use with support vector machines (SVMs) in a discriminative approach to the protein classification problem. Our kernel is conceptually simple and efficient to compute and, in experiments on the SCOP database, performs well in comparison with state-of-the-art methods for homology detection. Moreover, our method produces an S...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015